Two-Phase Biomedical NE Recognition based on SVMs

نویسندگان

  • Ki-Joong Lee
  • Young-Sook Hwang
  • Hae-Chang Rim
چکیده

Using SVMs for named entity recognition, we are often confronted with the multi-class problem. Larger as the number of classes is, more severe the multiclass problem is. Especially, one-vs-rest method is apt to drop the performance by generating severe unbalanced class distribution. In this study, to tackle the problem, we take a two-phase named entity recognition method based on SVMs and dictionary; at the first phase, we try to identify each entity by a SVM classifier and post-process the identified entities by a simple dictionary look-up; at the second phase, we try to classify the semantic class of the identified entity by SVMs. By dividing the task into two subtasks, i.e. the entity identification and the semantic classification, the unbalanced class distribution problem can be alleviated. Furthermore, we can select the features relevant to each task and take an alternative classification method according to the task. The experimental results on the GENIA corpus show that the proposed method is effective not only in the reduction of training cost but also in performance improvement: the identification performance is about 79.9(Fβ = 1), the semantic classification accuracy is about 66.5(Fβ = 1).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Disguised Face Recognition by Using Local Phase Quantization and Singular Value Decomposition

Disguised face recognition is a major challenge in the field of face recognition which has been taken less attention. Therefore, in this paper a disguised face recognition algorithm based on Local Phase Quantization (LPQ) method and Singular Value Decomposition (SVD) is presented which deals with two main challenges. The first challenge is when an individual intentionally alters the appearance ...

متن کامل

Discriminative named entity recognition of speech data using speech recognition confidence

This paper presents a method for the named entity recognition (NER) of speech data that uses automatic speech recognition (ASR) confidence as a feature that indicates whether each word is correctly recognized. An NER model is trained using ASR results with named entity (NE) labels to include an ASR confidence feature as well as corresponding transcriptions with NE labels. Experiments using supp...

متن کامل

Efficient Support Vector Classifiers for Named Entity Recognition

Named Entity (NE) recognition is a task in which proper nouns and numerical information are extracted from documents and are classified into categories such as person, organization, and date. It is a key technology of Information Extraction and Open-Domain Question Answering. First, we show that an NE recognizer based on Support Vector Machines (SVMs) gives better scores than conventional syste...

متن کامل

Comparison of SVMs in Number Plate Recognition

High accuracy and high speed are two key issues to consider in automatic number plate recognition (ANPR). In this paper, we construct a recognition method based on Support Vector Machines (SVMs) for ANPR. Firstly, we briefly review some knowledge of SVMs. Then, the number plate recognition algorithm is proposed. The algorithm starts from a collection of samples of characters. The characters in ...

متن کامل

A New Approach for Applying Support Vector Machines in Multiclass Problems Using Class Groupings and Truth Tables

The Support Vector Machines (SVMs) had been showing a high capability of complex hyperplane representation and great generalization power. These characteristics lead to the development of more compact and less computational complex methods than the One-versus-Rest (OvR) and One-versus-One (OvO) [1] classical methods in the application of SVMs in multiclass problems. This paper proposes a new me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003